On scaling up sensitive data auditing
نویسندگان
چکیده
منابع مشابه
On Scaling Up Sensitive Data Auditing
This paper studies the following problem: given (1) a query and (2) a set of sensitive records, find the subset of records “accessed” by the query. The notion of a query accessing a single record is adopted from prior work. There are several scenarios where the number of sensitive records is large (in the millions.) The novel challenge addressed in this work is to develop a general-purpose solu...
متن کاملScaling Up Context-Sensitive Text Correction
The main challenge in an effort to build a realistic system with context-sensitive inference capabilities, beyond accuracy, is scalability. This paper studies this problem in the context of a learning-based approach to context sensitive text correction – the task of fixing spelling errors that result in valid words, such as substituting to for too, casual for causal, and so on. Research papers ...
متن کاملQuickstep: A Data Platform Based on the Scaling-Up Approach
Modern servers pack enough storage and computing power that just a decade ago was spread across a modestsized cluster. This paper presents a prototype system, called Quickstep, to exploit the large amount of parallelism that is packed inside such modern servers. Quickstep builds on a vast body of previous work on methods for organizing data, optimizing, scheduling and executing queries, and bri...
متن کاملScaling-up Split-Merge MCMC with Locality Sensitive Sampling (LSS)
Split-Merge MCMC (Monte Carlo Markov Chain) is one of the essential and popular variants of MCMC for problems when an MCMC state consists of an unknown number of components. It is well known that state-of-the-art methods for split-merge MCMC do not scale well. Strategies for rapid mixing requires smart and informative proposals to reduce the rejection rate. However, all known smart proposals in...
متن کاملScaling Up Question-Answering to Linked Data
Linked Data semantic sources, in particular DBpedia, can be used to answer many user queries. PowerAqua is an open multi-ontology Question Answering (QA) system for the Semantic Web (SW). However, the emergence of Linked Data, characterized by its openness, heterogeneity and scale, introduces a new dimension to the Semantic Web scenario, in which exploiting the relevant information to extract a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the VLDB Endowment
سال: 2013
ISSN: 2150-8097
DOI: 10.14778/2535573.2488338